Leveraging Biological Identifier Relationships and Related Docu- ments to Enhance Information Retrieval for Proteomics
نویسندگان
چکیده
Leveraging Biological Identifier Relationships and Related Documents to Enhance Information Retrieval for Proteomics Andrew Smith, Kei Cheung, Michael Krauthammer, Martin Schultz and Mark Gerstein Department of Molecular Biophysics and Biochemistry, Department of Computer Science, Center for Medical Informatics, Department of Genetics, Program in Computational Biology and Bioinformatics, Department of Anesthesiology, Department of Pathology, Yale University, New Haven, CT USA Corresponding Author
منابع مشابه
Leveraging the structure of the Semantic Web to enhance information retrieval for proteomics
MOTIVATION Proteomics researchers need to be able to quickly retrieve relevant information from the web and the biomedical literature. To improve information retrieval, we leverage the structure of the semantic web, developing an approach for joining it with the largely opposing paradigm of unsupervised web search. RESULTS Our approach uses a Resource-Description-Framework (RDF) graph that in...
متن کاملLearning Ranking Functions for Geographic Information Retrieval Using Genetic Programming
Geographic Information Retrieval (GIR) has emerged as a new and promising tool for representation, storage, organisation of and access to geographic information. One of the current issues in GIR research is ranking of retrieved documents by both textual and geographic similarity measures. This paper describes an approach that learns GIR ranking functions using Genetic Programming (GP) methods b...
متن کاملLinkHub: a Semantic Web System for Efficiently Handling Complex Graphs of Proteomics Identifier Relationships that Facilitates Cross-database Queries and Information Retrieval
Background: A key abstraction in representing proteomics knowledge is the notion of unique identifiers for individual entities (e.g. proteins) and the massive graph of relationships among them. These relationships are sometimes simple (e.g. synonyms) but are often more complex (e.g. one-to-many relationships in protein family membership). Results: We have built a software system called LinkHub ...
متن کاملInformation Retrieval Using Statistical Classification
In the classical information retrieval IR problem the system must nd all docu ments in a collection that are related to a topic de ned by a user s query A common approach to the IR problem is to represent documents and the query as vectors of term frequencies and rank the documents in the collection according to their inner product similarity with respect to the query When a sample of evaluated...
متن کاملEvaluating Lexicon Coverage for Cross-Language Information Retrieval
Cross language retrieval systems use queries ex pressed in one natural language to retrieve docu ments that may be written in a di erent language Term by term translation techniques based on bilin gual lexicons are now widely used for cross language retrieval but little is known about the way in which retrieval e ectiveness varies with dictionary cover age This paper compares three coverage mea...
متن کامل